Discovering Company Descriptions on the Web by Multiway Analysis

نویسندگان

  • Vojtech Svátek
  • Petr Berka
  • Martin Kavalec
  • Jirí Kosek
  • Vladimír Vávra
چکیده

We investigate the possibility of web information discovery and extraction by means of a modular architecture analysing separately the multiple forms of information presentation, such as free text, structured text, URLs and hyperlinks, by independent knowledge-based modules. First experiments in discovering a relatively easy target, general company descriptions, suggests that web information can be efficiently retrieved in this way. Thanks to the separation of data types, individual knowledge bases can be much simpler than those used in information extraction over unified representations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering the Dimensions of Emotional Climate of Organization in National Iranian Oil Products Distribution Company

This study aimed to explore the dimensions of emotional climate of organization in National Iranian Oil Products Distribution Company (NIOPDC). The statistical population consisted of all employees of NIOPDC in Khorasan Razavi District, and the sample was selected using purposeful sampling method up to the theoretical saturation with the sampling adequacy of 19 people. Data were collected throu...

متن کامل

A relation between the theory of formal concepts and multiway clustering

Contexts where entity descriptions belong to a meet-semilattice are considered. When the entity set is finite, we show that nonempty extensions of concepts assigned to such contexts coincide, casewise, with strong or weak clusters associated with some pairwise or multiway dissimilarity measure. Moreover, by duality principle, a similar result holds when entity descriptions belong to a join-semi...

متن کامل

Web-based Information for Medical Tourism: Case Study of AriaMedTour Medical Tourism Company, Iran

Objective: As one of the well-known countries for medical tourism, Iran has the potential for growth in this industry and requires information and advertisements in online media and websites. This study aims to investigate the effectiveness of the content produced by the website of AriaMedTour Medical Tourism Company in informing tourists. Methods: This is an applied study that adopted an indu...

متن کامل

Positioning of Industries in Cyberspace Evaluation of Web Sites Using Correspondence Analysis

  In today’s extremely competitive markets it is crucial for companies to strategically position their brands, products and services relative to their competitors. With the emerging trend in internationalization of companies especially SME’s and the growing use of the Internet with this regard, great amount of attention has been turned to effective involvement of the Internet channel in the mar...

متن کامل

Focused Crawling of the Deep Web Using Service Class Descriptions

Dynamic Web data sources—sometimes known collectively as the Deep Web—increase the utility of the Web by providing intuitive access to data repositories anywhere that Web access is available. Deep Web services provide access to real-time information, like entertainment event listings, or present a Web interface to large databases or other data repositories. Recent studies suggest that the size ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003